Visual speech detection using OpenCV

نویسندگان

  • Usman Ghani Khan
  • Sajid Mahmood
  • Mahmood Ahmed
  • Yoshihiko Gotoh
چکیده

Visual information from the human face; lip-movements and tongue provide us with lots of information about the spoken message and helps in understanding the verbal communication. The visual speech detection overcomes some of the persistent problems and inaccuracies encountered by users that creep in when there is background noise. In noisy environment we pay more attention to the lips which dramatically improves our understanding of what other people are saying. This research is focussed towards creation of a speech detector which works solely on video data. This work is part of speaker identification problem in videos. We propose speaker identification using visual clues only. Based on visual information, presence of speech can be extracted in video sequences.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vowel Recognition of Patients after Total Laryngectomy using Mel Frequency Cepstral Coefficients and Mouth Contour

The paper addresses a problem of isolated vowels recognition in patients following total laryngectomy. The visual and acoustic speech modalities were separately incorporated in the machine learning algorithms. The authors used the Mel Frequency Cepstral Coefficients as acoustic descriptors of a speech signal. A lip contour was extracted from a video signal of the speaking faces using OpenCV sof...

متن کامل

Moving Vehicle Detection for Measuring Traffic Count Using OpenCV

System in this paper is designed and implemented with Visual C++ software with Intel's OpenCV video stream processing system to realize the real-time automatic vehicle detection and vehicle counting. Expressways, highways and roads are getting overcrowded due to increase in number of vehicles. Vehicle detection, tracking, classification and counting is very important for military, civilian and ...

متن کامل

Deep Parameter Optimisation for Face Detection Using the Viola-Jones Algorithm in OpenCV

OpenCV is a commonly used computer vision library containing a wide variety of algorithms for the AI community. This paper uses deep parameter optimisation to investigate improvements to face detection using the Viola-Jones algorithm in OpenCV, allowing a tradeoff between execution time and classification accuracy. Our results show that execution time can be decreased by 48% if a 1.80% classifi...

متن کامل

An Improved ORB Algorithm of Extracting and Matching Features

For feature extraction of image mosaicing, an improved fast extracting algorithm of binary feature points is presented which is based on ORB (Oriented FAST and Rotated BRIEF). In the process of detection using median filter method to detect more accurate feature points. Improving the speed by using the ORB algorithm to extract image binary descriptors and using RANSAC algorithm and homography m...

متن کامل

Robust Automatic Traffic Signs Recognition Using Fast Polygonal Approximation of Digital Curves and Neural Network

Traffic Sign Detection and Recognition (TSDR) has many features help the driver in improving the safety and comfort, today it is widely used in the automotive manufacturing sector, a robust detection and recognition system a good solution for driver assistance systems, it can warn the driver and control or prohibit certain actions which significantly increase driving safety and comfort. This pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009